Logarithmic Equal-Letter Runs for BWT of Purely Morphic Words

نویسندگان

چکیده

In this paper we study the number $$r_{\texttt {bwt}}$$ of equal-letter runs produced by Burrows-Wheeler transform (BWT) when it is applied to purely morphic finite words, which are words generated iterating prolongable morphisms. Such a parameter very significant since provides measure performances BWT, in terms both compressibility and indexing. particular, prove that, BWT whichever word on binary alphabet, $$\mathcal {O}(\log n)$$ , where n length word. Moreover, that $$\varTheta (\log for large class These bounds proved providing some new structural properties bispecial circular factors such words.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extremal words in morphic subshifts

Given an infinite word x over an alphabet A, a letter b occurring in x, and a total order σ on A, we call the smallest word with respect to σ starting with b in the shift orbit closure of x an extremal word of x. In this paper we consider the extremal words of morphic words. If x = g(fω(a)) for some morphisms f and g, we give two simple conditions on f and g that guarantees that all extremal wo...

متن کامل

Logarithmic density in morphic sequences par

We study the logarithmic frequency of letters and words in morphic sequences and show that this frequency must always exist, answering a question of Allouche and Shallit.

متن کامل

Optimal-Time Text Indexing in BWT-runs Bounded Space

Indexing highly repetitive texts — such as genomic databases, software repositories and versioned text collections — has become an important problem since the turn of the millennium. A relevant compressibility measure for repetitive texts is r, the number of runs in their Burrows-Wheeler Transform (BWT). One of the earliest indexes for repetitive collections, the Run-Length FMindex, used O(r) s...

متن کامل

Almost Rich Words as Morphic Images of Rich Words

We focus on Θ-rich and almost Θ-rich words over a finite alphabet A, where Θ is an involutive antimorphism over A∗. We show that any recurrent almost Θ-rich word u is an image of a recurrent Θ′-rich word under a suitable morphism, where Θ′ is also an involutive antimorphism. Moreover, if the word u is uniformly recurrent, we show that Θ′ can be set to the reversal mapping. We also treat one spe...

متن کامل

On the Periodicity of Morphic Words

Given a morphism h prolongable on a and an integer p, we present an algorithm that calculates which letters occur infinitely often in congruent positions modulo p in the infinite word h(a). As a corollary, we show that it is decidable whether a morphic word is ultimately p-periodic. Moreover, using our algorithm we can find the smallest similarity relation such that the morphic word is ultimate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-05578-2_11